The Prosody Module
نویسندگان
چکیده
We describe the acoustic-prosodic and syntactic-prosodic annotation and classification of boundaries, accents and sentence mood integrated in the Verbmobil system for the three languages German, English, and Japanese. For the acoustic-prosodic classification, a large feature vector with normalized prosodic features is used. For the three languages, a multilingual prosody module was developed that reduces memory requirement considerably, compared to three monolingual modules. For classification, neural networks and statistic language models are used.
منابع مشابه
Improved Prosody Module in a Text-to-Speech System
The newly-developed prosody module of our text-to-speech (TTS) system is described in the paper. We present two main works on it’s establishment and improvement. On the basis of potential factors influencing prosody parameters, inclusive of duration, pitch and intensity, the prosody model is built as groundwork of this module which is superior to the former rule-based one in generation of natur...
متن کاملComparing Prosody Formalisms for Machine Learning
We need to find the most suitable prosody formalism for the task of machine learning. The target application is a prosody generative module for text-to-speech synthesis. This module will learn prosody marks (parameters or symbols) from large corpora. Formalism we are looking for should be general, perceptually relevant, restorable, automatically obtained, objective and learnable. Main formalism...
متن کاملSynthesis of Spoken Messages from Semantic Representations. Semantic-Representation-to-Speech System
A semantic-representation-to-speech system communicates orally the information given in a semantic representation. Such a system must Integrate a text generation module, a phonetic conversion module, a prosodic module and a speech synthesizer We wil l see how the syntactic information elaborated by the text generatlon module is used for both phonetic conversion and prosody, so as to produce the...
متن کاملProminence-Based Prosody Prediction for Unit Selection Speech Synthesis
This paper describes the development and evaluation of a prosody prediction module for unit selection speech synthesis that is based on the notion of perceptual prominence. We outline the design principles of the module and describe its implementation in the Bonn Open Synthesis System (BOSS). Moreover, we report results of perception experiments that have been conducted in order to evaluate pro...
متن کاملFeedback loop for prosody prediction in concatenative speech synthesis
We propose a method for concatenative speech synthesis that permits to obtain a better matching between the logF0 and duration predicted by the prosody module and the waveform generation back-end. The proposed method is based upon our previous multilevel parametric F0 model and Toshiba’s plural unit selection and fusion synthesizer. The method adds a feedback loop from the back-end into the pro...
متن کاملA Thematicity-Based Prosody Enrichment Tool for CTS
This paper presents a demonstration of a stochastic prosody tool for enrichment of synthesized speech using SSML prosody tags applied over hierarchical thematicity spans in the context of a CTS application. The motivation for using hierarchical thematicity is exemplified, together with the capabilities of the module to generate a variety of SSML prosody tags within a controlled range of values ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006